CDS
Accession Number | TCMCG075C12047 |
gbkey | CDS |
Protein Id | XP_017975580.1 |
Location | complement(join(1533392..1533712,1533816..1534026,1534288..1534427,1534814..1534895,1535213..1535529,1536361..1536447,1536550..1536710,1536925..1537078,1537187..1537306,1537864..1537974,1538254..1538430,1538799..1538944,1539159..1539345,1539524..1539661,1539819..1539990,1540072..1540262,1540773..1541018,1541159..1541506)) |
Gene | LOC18600905 |
GeneID | 18600905 |
Organism | Theobroma cacao |
Protein
Length | 1102aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_018120091.1 |
Definition | PREDICTED: uncharacterized protein LOC18600905 isoform X2 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
COG_category | U |
Description | isoform X1 |
KEGG_TC | - |
KEGG_Module | - |
KEGG_Reaction | - |
KEGG_rclass | - |
BRITE |
ko00000
[VIEW IN KEGG] ko00001 [VIEW IN KEGG] ko01000 [VIEW IN KEGG] ko03012 [VIEW IN KEGG] |
KEGG_ko |
ko:K03263
[VIEW IN KEGG] ko:K05294 [VIEW IN KEGG] |
EC | - |
KEGG_Pathway |
ko00563
[VIEW IN KEGG] ko01100 [VIEW IN KEGG] map00563 [VIEW IN KEGG] map01100 [VIEW IN KEGG] |
GOs | - |
Sequence
CDS: ATGTTGAATAATATGGAAAATGAGAGACGGAATAATAACAAAAATGAGGATGCGAGAATGCGAGGGTTTAGGCCTAGTTTGAGGGCTATGATGCTTGTGATTGCGGTGATATGGGTTGGTGTAGCTGCTTTGTATGGTTTGTTGAAGCCTGTATCGAACGGATGTATTATGACATATATGTATCCGACTTATATCCCGATTTCCACCAGAGAGGGCGTCTCATCTGTGAAGTACGGGTTGTATTTGTACCATGAAGGTTGGAGAAAGATTGATTTTAAGGAACACTTGAAGAACCTAAATGGAATTCCGGTTCTTTTTATTCCAGGCAATGGTGGCAGCTACAAACAGGTGCGCTCCTTGGCAGCTGAATCTGATAGAGCTTATCAAGGAGGTTCACTTGAACGTACATTCTATAGAGAAGCTTATCTAACTTCTGAGGAGGGAGGGAATGTGGATGTGGCTGACTTTCAATTACCCAACCGATATGCTAACAGGCTCGATTGGTTTGCTGTGGATCTTGAGGGTGAACATTCTGCAATGGATGGTCGGATACTCGAAGAGCACACTGAATATGTTGTATATGCTATTCATAGGATTTTGGATCAATACAAAGAATCCCGTGATGCTCGGAAAAGAGAGGGTGCTGCAACCACTGGTAGTTTGCCAAAAAGTGTCATATTGATTGGCCACTCTATGGGTGGTTTTGTTGCTAGAGCTGCAACTATCCACCCACATCTAAGGAAATCTGCAGTTGAGACTATTCTCACTCTTTCAAGCCCCCACCAATCACCTCCTGTGGCATTGCAACCATCCCTAGGTCATTACTATGAAAGTATAAATCAAGAATGGAAAAAGGGGTATGAGGTTCAAACCACTCAGACAGGGCATTATGTGTCTGGTCCAGCACTTTCTCATGTAGTTGTTGTTTCCATTTCTGGTGGTTATAATGATTATCAGGTACGCTCAAAATTAGAATCACTTGACAGTATTGTGCCCCCCACTCATGGATTTATGATAAGCAGTACGAGCATGAAAAATGTATGGCTATCTATGGAACATCAAGCTATTTTGTGGTGTAATCAACTAGTTGTGCAAGTGTCACATACTCTCCTTAGTTTGATAGACTCCAGAACAGGTCAGCCTTTGCCTGACACTCGACAAAGACTTGAAATATTTACAAGGATGCTTCGTAGTGGAATTCCGCAAAGTTTCAACTGGAAGATGCAATCACAGTCATCCTGGTCAACTCATGTTCCTGTGAAGGATGTAAAAGACACTGCTGGTTCCCAAGTGCATAACTTATTTGACTGTCCTAGCAGTGTCCATTGGAGTGATGATGGCCTTGAGAGGGATTTGTATATTCAGACAACAACCGTCACTGTTTTGGCCATGGATGGGAGAAGGCGGTGGTTGGACATAGAGAAATTGGGGTCCAATGGCAAAAGCCACTTCATATTTGTGACAAACCTTGCTCCTTGTTCTGGAGTCCGAATTCATCTCTGGCCTCAAAAGGGGAAATCATCTTCAGACTTGCCTGCTGGTAAAAGGGTTCTGGAAGTGACATCAAAGATGGTGCAAATTCCTGCAGGACCAGCACCAAGGCAGATTGAGCCTGGCAGTCAGACTGAGCAAGCACCTCCATCCGCGGTACTTCATTTGGGTCCTGAGGAAATGCATGGCTTCAGATTCCTGACTATCTCAGTTGCACCTCGTCCGACTATTTCAGGGAGGCCTCCGCCAGCCACTTCCATGGCAGTTGGGCAATTCTTTAATCCAGATGAAGGGGAGATAGAGTTCTCTCCTATATCGATGCTTCTGGCAACTCATTCGCATAAGGATGTATTGTTGAAGGAGGACCACCCACTTGCCTTCAATCTATCATTTGCAATTAGTTTAGGTCTTTTGCCTGTTACCTTCTCTTTGAAAACTGCTGGCTGTGGAATAAAAGATTCTGGGCTTCTTGATGAAGCTGGAGATTTGGAAAACACTAAGCTTTGCAAGCTGCGCTGTTTCCCACCTGTAGCACTTGCTTGGGATCCCACATCAGGTCTTCACGTATTTCCAAATTTGTACAGTGAGACTCTTGTTGTTGATTCCTCCCCAGCACTTTGGGCTTCGACTGGAACAGAGAAAACCACTGTTCTCTTACTGCTTGACCCACATTGTTCATATAAGGCAAGCATAGCTGTTTCTGTAACTCCAGCGGCCAGCAGATTTTTGCTTCTATATAGTTCGCAGATAGTTGGGTTCTCTGTTGCTGTTATACTTTTTGCTCTGATGCGACAAGCACATGCAAGGCCAATTCCTTCTATACTGAAAGCTGTGGAGTCCAACCTAAAAATACCATTCCCATTTTTGCCTTTTGCTGTAGTACCCATTTTGGTTTCCTTGTTCTTTTCCTTTCTAACATCTCAACCATTTCCTCCATTCTTTAGCTTCACCATTGTGTCAATGATTTGCTACCTATTTGCAAATGGGTTTGTAATTCTACTGATATTAGTTTCCCAGTTGGTCTTCTATGTGGCTGCCTCTATACATGTTCTCATAAAGAGGAGGTGGCAACTATGGGAAGGAAATTTTTGCTTTTTATTTCTGCAATGGTTTATGAATCTTTCTTCCAAGTTCTTTTCATTAAAGGTGGTAAGGGTTCTAAGAGCCAATCCATTATTCATTCCAATATCAGCAGCAATTGTTTTGTCTACATTTGTACATCCAGCACTTGGCCTATTCATACTGATCTTGTCTCATGCTTTGTGTTGTCATAGTTCGCTGTGCAACCATGCAAGGAAAAAGGAATTGTCTGATTGCAAAGGTGAAGGCAATTATTTGTCTCAGCAGTTTGCATCCAAACCTGGTTCCCCTTCTAAAGAAAACAGCTCCAGTTATGGTCAGACACAAGAGGATACCTTCCACCACCGGCATGGCTTACTGATGCTTCATCTTCTTGCAGCACTAATGTTTGTTCCCTCTCTCGTTTCTTGGTTGCAGAGAATAGGGATGCATCAGAGCTTTCCAAGGTTCCTGGATTCATTCCTTTGCATTTGTTTGATCCTTCATGGTATCTTTAGTTCAGAGTCGTTGCTAAGTTCCTCGTTGCCCTTTCCACGCATCCTGGGTCAGGAAGTGAGACTGAATTTCGTCTACCTAATTGCCGGAATGTACTCCTATTTATCTGGTCTGGCTTTGGAACCTTATAAAGTGTTTTATGCCATGGGTGCCGTTGGGATCGTATCCTTTGCATTGAGTATCTTACAGGTATGGACAGGAGCACCGCGGTTCGGAAGAAGACGGCATTGGCACAGACACTAG |
Protein: MLNNMENERRNNNKNEDARMRGFRPSLRAMMLVIAVIWVGVAALYGLLKPVSNGCIMTYMYPTYIPISTREGVSSVKYGLYLYHEGWRKIDFKEHLKNLNGIPVLFIPGNGGSYKQVRSLAAESDRAYQGGSLERTFYREAYLTSEEGGNVDVADFQLPNRYANRLDWFAVDLEGEHSAMDGRILEEHTEYVVYAIHRILDQYKESRDARKREGAATTGSLPKSVILIGHSMGGFVARAATIHPHLRKSAVETILTLSSPHQSPPVALQPSLGHYYESINQEWKKGYEVQTTQTGHYVSGPALSHVVVVSISGGYNDYQVRSKLESLDSIVPPTHGFMISSTSMKNVWLSMEHQAILWCNQLVVQVSHTLLSLIDSRTGQPLPDTRQRLEIFTRMLRSGIPQSFNWKMQSQSSWSTHVPVKDVKDTAGSQVHNLFDCPSSVHWSDDGLERDLYIQTTTVTVLAMDGRRRWLDIEKLGSNGKSHFIFVTNLAPCSGVRIHLWPQKGKSSSDLPAGKRVLEVTSKMVQIPAGPAPRQIEPGSQTEQAPPSAVLHLGPEEMHGFRFLTISVAPRPTISGRPPPATSMAVGQFFNPDEGEIEFSPISMLLATHSHKDVLLKEDHPLAFNLSFAISLGLLPVTFSLKTAGCGIKDSGLLDEAGDLENTKLCKLRCFPPVALAWDPTSGLHVFPNLYSETLVVDSSPALWASTGTEKTTVLLLLDPHCSYKASIAVSVTPAASRFLLLYSSQIVGFSVAVILFALMRQAHARPIPSILKAVESNLKIPFPFLPFAVVPILVSLFFSFLTSQPFPPFFSFTIVSMICYLFANGFVILLILVSQLVFYVAASIHVLIKRRWQLWEGNFCFLFLQWFMNLSSKFFSLKVVRVLRANPLFIPISAAIVLSTFVHPALGLFILILSHALCCHSSLCNHARKKELSDCKGEGNYLSQQFASKPGSPSKENSSSYGQTQEDTFHHRHGLLMLHLLAALMFVPSLVSWLQRIGMHQSFPRFLDSFLCICLILHGIFSSESLLSSSLPFPRILGQEVRLNFVYLIAGMYSYLSGLALEPYKVFYAMGAVGIVSFALSILQVWTGAPRFGRRRHWHRH |